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METHOD OF PRODUCING HUMAN GROWTH FACTORS FROM 
5 WHOLE PLANTS OR PLANT CELL CULTURES 

FIELD OF THE INVENTION 

The present invention relates generally to a method for producing human 
10 growth factors from whole plants or plant cell culture. More specifically, the 

invention relates to producing a human growth factor from a plant cell encoded to 
produce the human growth factor with a length of at least 200 amino acids from 
transgenic plant cells. 

15 BACKGROUND OF THE INVENTION 

Growth factors and monoclonal antibodies (Mabs) are diverse yet highly 
specialized types of proteins having research and commercial applications in areas 
of therapeutics and diagnostics. 

20 Therapeutic uses of human epidermal growth factor (hEGF) include treatment 

of soft tissue wounds (U.S. 5,218,093, 1993), specifically including skin and eye 
injuries as well as corneal and stomach ulcers (Frost and Sullivan 1996, 1994). In 
addition, several hEGF-bearing fusion constructs have been considered and/or 
tested, including mitotoxins for treatment of restenosis (Frost and Sullivan, 1994) 

25 and radioconjugates for a variety of antineoplastic therapies (Grieg et al., 1988). 

Current production techniques for these proteins such as hybridoma and other 
types of mammalian cell culture methods (Kohler and Milsten, 1975) are generally 
slow, labor intensive, and consequently, expensive. In addition, current 
production techniques are difficult to validate due to the pathogenic and oncogenic 

30 potential of cultivated mammalian tissue. 

Multimers of from 2 to 7 EGF units each having 53 amino acid residues 
have been produced from bacterial hosts, eg E. coli, Streptomyces and Bacillus, 
fungal hosts, eg Saccharomyces, Pichia and Aspergillus, insect cell host, and 
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mammalian cell hosts, eg CHO cells and COS cells. (U.S. Patent No. 5218093, 
1993). hEGF production in Staphylococcus aureus (U.S. Patent No. 5004686, 
1991) is by a fusion construct encoding hEGF linked to a protein. Synthesis 
methods using transgenic bacterial strains have problems such as faulty antibody 
gene expression, protein folding difficulties, inability to glycosylate proteins, and 
relegation of foreign peptides to insoluble material accumulated in inclusion 
bodies. 

Transgenic plants can be used for the production of high value, medicinally 
important proteins, for example, production of Mabs (Hiatt et al., 1989; During et 
al., 1990; Benvenuto et al. 1991, Firek et al. 1993, Gao et al. 1993), human 
growth hormone (Kay et al. 1986) and human serum albumin (Sijmons et al. 
1990). Transformed cells synthesize, secrete, and accumulate functional 
antibodies including single (Benvenuto et al. 1991) and double (During et al. 1990, 
Hiatt et al. 1991) domain immunoglobulins. However, it is noted that none of 
these authors investigated production of any human growth factor from transgenic 
plants. 

Plant cell culture media are well-defined and inexpensive compared to 
mammalian cell culture media. Further, plant cell products, unlike mammalian- 
derived protein formulations, are generally assumed as neither pathogenic nor 
oncogenic to humans (Crawford, 1995). Also, when compared to similar 
production in transgenic bacterial strains (Attaai and Shuler 1987), plant tissue 
culture methods showed greater stability of foreign gene expression, even without 
use of selection pressure (Gao et al. 1991). One author, Higo et al. (1993) 
produced a human growth factor, specifically hEGF in transgenic tobacco with 
cDNA fragment size of 180 bp. Unsatisfactory foreign peptide levels of 20 to 60 
pg/mg (ppb) total soluble leaf protein were obtained. This is despite the fact that 
plant progeny appeared to produce high levels of hEGF mRNA. Exact reasons for 
low observed levels of hEGF production are unclear. However, no signal peptide 
was encoded upstream of hEGF cDNA which could cause the foreign protein to be 
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relegated to the cytosol. Within this cell fraction, hEGF suffers proteolytic attack, 
especially considering the relatively small size (53 amino acids) or the peptide. 

Although advantages have been observed for deriving proteins including EGF 
from plants, no transgenic plant cell culture process has been commercially 
5 developed for production of human growth factor. The lack of commercial 
exploitation of plant derived proteins is due in part to existing technological 
hurdles as observed by Higo et al. In addition, Ma et al., 1995 reported Mab 
titers of up to 500 pg/g (ppm) fresh weight of plant material (or 300 mg/L on a 
cell culture basis) whereas comparable mammalian cell processes are reported to 

10 attain levels of 1-2 g/L and higher (Rosenberg, personal communication, 1995). 
Implementation of alternative production systems to mammalian and bacterial 
culture, such as plant cellular techniques, has been further limited by non- 
technological factors, such as industry and regulatory acceptance (Simonsen and 
McGrogan, 1994) because of the investment made in developing and validating the 

15 more established non-plant methods. 

Accordingly, there is a continuing need for plant based production of human 
growth factors. 

Background References 

20 
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SUMMARY OF THE INVENTION 

10 Despite the hurdles in technology development and commercialization, 

economic analysis indicates that regulatory costs associated with plant cell culture 
may reduce by as much as $70,000 per batch as compared to analogous 
mammalian cell processes (Crawford, 1995). In addition, direct production costs 
for whole plant processes at equal protein production rates appear to be two to 

15 four orders-of-magnitude lower than comparable mammalian cell processes 
(Agracetus 1995). Additionally, as plant cell titers increase, this type of 
production becomes even more capital cost-effective. 

It is, therefore, an object of the present invention to provide whole plant and 
plant cell culture derived human growth factors at higher overall concentrations 

20 and production rates, comparable to mammalian host cell systems. 

It is a further object of the present invention to synthesize specific human 
growth factors. 

It is another object of the present invention to increase production rates and 
concentrations by increasing protein stability through the use of fusion constructs. 
25 It is a further object of the present invention to use Nicotiana tabacum 

(tobacco) and Solarium tuberosum (potato) whole plants and highly synchronous 
suspensions. 

According to the present invention, the production of human growth factors 
is achieved in whole plants or plant cell culture wherein the human growth factor 
30 is produced with a length of at least 200 amino acids. For epidermal growth 
factor this would comprise at least a tetramer of EGF units. 
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Modifying chimeric cDNA and subcloning into a plant expression vector are 
done using standard molecular cloning procedures (Ausubel et al. 1992) and 
splicing PCR techniques (Marks et al. 1992). 

Effectiveness or production of the translation process has been increased 
5 according to the present invention by (1) cloning of pre-pro-EGF cDNA of 
approximately 4.5 kb into both whole plants and cell culture to increase overall 
titers of active hEGF, (2) synthesizing cDNA and transforming plants and cell 
culture for production of an oligomeric polypeptide consisting of repeated hEGF 
domains, and (3) increasing the overall size of the gene to be expressed with a 

10 fusion construct encoding hEGF linked to a protein that is efficiently produced in 
plant systems. As needed, synthetic cDNA includes plant-specific proteolytic 
cleavage sites between EGF repeats to facilitate correct processing in planta. 
Appropriate proteolytic cleavage sites upstream and downstream of hEGF are 
added if needed to obtain final product. 

15 The subject matter of the present invention is particularly pointed out and 

distinctly claimed in the concluding portion of this specification. However, both 
the organization and method of operation, together with further advantages and 
objects thereof, may best be understood by reference to the following description 
taken in connection with accompanying drawings wherein like reference characters 

20 refer to like elements. 



BRIEF DESCRIPTION OF THE DRAWINGS 



FIG. 1 provides the size of EGF precursor (pre-pro-EGF) relative to 
25 correctly processed EGF. 

FIG. 2 depicts schematically the construction of pZD203, a vector used to 
modify the restriction sites on pre-pro-EGF to develop cDNA suitable for cloning 
into the plant expression vector pGA643. 

FIG. 3 depicts schematically the construction of pZD204, the plant 
30 expression vector carrying pre-pro-EGF. 
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FIG. 4 shows EGF levels seen in individual calli resulting from positive 
transformation and antibiotic selection. EGF concentrations were determined 
using enzyme-linked immunosorbent assay and are based on a 30 KD protein size. 

5 DESCRIPTION OF THE PREFERRED EMBODIMENT(s) 

The present invention is a method for production of human growth factors 
using whole plants as well as plant cell suspensions transformed with appropriately 
constructed vector plasmids, wherein the human growth factor is produced with a 

10 length of at least 200 amino acids. More specifically, the method of the present 
invention is stable expression of human growth factors of interest as direct 
therapeutics, targeted delivery systems and research reagents. Human growth 
factors produced include human epidermal growth factor (hEGF), transforming 
growth factor (TGF), vascular endothelial growth factor (VEGF), platelet-derived 

15 growth factor (PDGF), fibroblast growth factor (FGF), tumor necrosis factor 
(TNF), heparin-binding epidermal growth factor (HBEGF), insulin-like growth 
factor (ILGF), platelet-derived endothelial cell growth factor (PDECGF), platelet- 
derived angiogenesis factor (PDAF), and bone-and-cartilage inducing growth 
factor (BCIF). 

20 Any plant from the plant kingdom may be utilized. Specific types of plants 

that are amenable to the transformation steps listed herein include, but are not 
limited to monocotyledonous, dicotyledonous, and tuberous plants. Preferred 
species include but are not limited to Nicotiana tabacum (tobacco), Solatium 
tuberosum (potato), Glycine max (soybean), and Zea mays (corn). 

25 The method of the present invention, a method of producing human growth 

factors from plant cells, has the steps of: 

(a) obtaining a positive transformant of the plant cells, the positive 
transformant carrying genetic material encoding the production of a human growth 
factor with a length of at least 200 amino acids; 

30 (b) cultivating the positive transformant; and 
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(c) obtaining the human growth factors. 
The step of obtaining may be as simple as purchasing or more complex 
actual making by well known methods, for example direct particle bombardment 
as described in Gene Transfer by Particle Bombardment, Klein TM, Knowlton S, 
5 Arentzen R, Plant Tissue Culture Manual, Dl, pp 1-12, 1991, Kluwer Academic 
Publishers, or by Agrobaterium mediated transformation as described in Hoekema 
et al. 1985 (Hoekema KM, Hirsch PR, Hooykaaf PJJ, Schliperoort RA, 1985, 
Nononcogenic Plant Vectors for Use in the Agrobacterium Binary System, Plant 
Molecular Biology, Vol. 5, 85-89), and further described herein. 
10 The step of cultivating involves either whole plant cultivating or tissue 

cultivating by any of well known cultivating methods. 

The step of obtaining is by well known separation purification steps, for 
example ultrafiltration, affinity chromatography, and/or electrophoresis. 

An Agrobacterium mediated transformation method of the present invention 
15 has the steps of: 

(a) modifying chimeric cDNA encoding a specific growth factor for 
subcloning into a plant expression vector 

(b) subcloning the chimeric cDNA into the plant expression vector; 

(c) transferring the plant expression vector containing transgenic plant 
20 cells to an agrobacterium; 

(d) co-cultivating a portion of the transgenic plant cells (suspension 
culture or leaf disks) with the agrobacterium; 

(e) selecting positive transformants from the co-cultivated culture on 
an antibiotic selective media; 

25 (f) permitting growth of the transgenic plant cells in whole plants or 

suspensions; and 

(g) extracting a liquid containing the human growth factor; 
wherein the improvement comprises: 

said human growth factor having a length of at least 200 amino acids. 
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Modifying chimeric cDNA and subcloning into a plant expression vector are 
done using standard molecular cloning procedures (Ausubel et al. 1992) and 
splicing PCR techniques (Marks et al. 1992). More specifically, modifying 
chimeric cDNA, has the steps of: 
5 (a) adding a transcription promoter to the upstream or 5' end of 

the chimeric cDNA; and 

(b) adding a transcription terminator to the downstream or 3' end 
of the chimeric cDNA. The transcription promoter and the transcription 
terminator are regulatory elements. 

10 Further, an additional regulatory element encoding a signal peptide may be added 
between the transcription promoter and the 5' end of the chimeric cDNA in order 
to relegate the product human growth factor to a specific cellular organelle. In 
addition, other regulatory elements may be added either between the promoter and 
the additional regulatory element encoding the signal peptide or at the 3' end of 

15 the chimeric cDNA to obtain greater mRNA stability between transcription and 
translation events. 

In either whole plants or cell cultures, to enhance expression of the chimeric 
gene (hEGF), the present invention further includes manipulation of a 35S 
promoter by duplication of the upstream region (-343 to -90 bp) of the CaMV 35S 

20 promoter to increase transcription activity, as well as use of TSC29 and TSC40 
promoters. These promoters and their transcription activity have been reported by 
Gao et al. 1994, and Dai et al. 1995. 

In whole plants, transcription promoters may include the upstream enhancer 
(nucleotides -343 to -90 relative to the transcription start site) of the CaMV 35S 

25 promoter (Benfey et al. 1989) or the chlorophyll a/b binding protein (cabl) 

promoter (Ha and An 1988). Use of these types of regulatory elements confers 
human growth factor production characteristics into traditionally non-salable 
portions of crop plants, such as the leafy tops of potatoes. Use of potato tops, for 
example, under post-harvest conditions, results in overexpression and production 
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of human growth factor in non-salable plant portions towards the end of the 
harvesting season, without affecting crop quality. 

Transferring the plant expression vector into the agrobacterium is completed 
using the freeze-thaw method (An 1987). For monocotyledonous species, super- 
5 binary vectors, such as pTOK233 and pSB131, are used to achieve high 

transformation frequency (Ishida et al. 1996). Remaining cocultivation, selection, 
growth, and extraction steps (d through g) have been described by Magnusen et al. 
(1996), and are well known in the art of plant molecular biology. 

Many human growth factors possess relatively short lengths of between 50 

10 and 100 amino acids. For example, hEGF has a length of 53 amino acids. 
Accordingly, obtaining a larger construct of at least 200 amino acids requires 
either (1) cloning the larger precursor cDNA, (2) synthesizing a concatemer 
consisting of multiple gene copies encoding the growth factor, or (3) increasing the 
overall size of a gene to be expressed using a fusion construct encoding a growth 

15 factor linked to a protein that is efficiently produced in plant systems. 

An example of obtaining a larger precursor to increase the overall protein 
size is the cDNA encoding pre-pro-EGF. This particular gene, at approximately 
4.5 kb, encodes a 1207 amino acid protein that, in vivo, is proteolytically cleaved 
to yield 53 amino acid EGF. In plant systems, this larger protein will provide 

20 additional stability against proteolytic degradation. 

Synthesizing the cDNA concatemer is preferably done by ligating multiple 
gene copies using peptide linkers to obtain a processed protein length of at least 
200 amino acids. The multiple gene copies are preferably an oligomeric 
polypeptide having of repeated growth factor cDNA domains. Peptide linkers may 

25 be used that are (1) proteolytically cleaved in planta, (2) proteolytically cleaved in 
a separate enzymatic treatment step, or (3) resistant to proteolytic cleavage. 
Peptide linkers that are proteolytically cleaved by serine proteases in planta 
preferably possess the amino acid sequence Arg-Asn. This sequence already exists 
when EGF is concatemerized since the C-terminal amino acid is arginine and the 

30 N-terminal amino acid is asparagine. To achieve in planta cleavage, the processed 
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protein is targeted either to the cell cytosol (no signal peptide) or vacuole 
(phytohemagglutinin signal peptide [Chrispeels et al.1991]). To achieve 
proteolytic cleavage in a separate enzymatic treatment step, the same amino acid 
sequence is preferably used (Arg-Asn) and the growth factor concatemer is either 
5 targeted to the chloroplast (pea photosystem II signal peptide) or secreted (PR-II 
signal peptide) to limit proteolytic degradation. To achieve resistance to 
proteolytic cleavage, linkers would preferably possess the amino acid sequence 
Arg-Pro. This sequence is resistant to serine proteases. Specifically for EGF, 
linkage would preferably be achieved by synthesizing cDNA encoding a single 

10 proline unit between growth factor monomers cDNA. 

Increasing the overall size of a gene may be done by ligating EGF with 
cDNA encoding a protective protein to protect from proteolytic cleavage, thereby 
forming a fusion construct. Protective proteins include but are not limited to 
streptococcal protein G or -galactosidase, that have both been shown to inhibit 

15 proteolysis when attached to the C-terminus of other foreign proteins (Hellebust et 
al. 1989). Gene size could also be increased by ligating EGF with cDNA 
encoding another protective protein of commercial interest that processes well in 
plant-based systems. Protective proteins further include human serum albumin 
(Sijmons et al. 1990) and phytase (Verwoerd et al. 1995). 

20 At least one genetic regulatory element may be included in the cDNA 

encoding the transcription of specific growth factors. Regulatory elements include 
transcription promoters or enhancers that increase the frequency of transcription 
events, leader sequences that increase the stability of mRNA prior to translation, 
and signal peptides that target proteins to specific organelles for posttranslational 

25 modifications and accumulation. Examples of transcription enhancers include but 
are not limited to the octapine synthase enhancer, a 16 bp palindrome 
. (ACGTAAGCGCTTACGT) (Ellis et al. 1987) and the B-domain of the 
cauliflower mosaic virus 35S promoter (Kay et al. 1987). An example of a leader 
sequence includes but is not limited to alfalfa mosaic virus RNA4 leader sequence 

30 (Jobling and Gehrke 1987). Examples of signal peptides include but are not 
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limited to the tobacco PR-S signal peptide (Cornelissen et al. 1986) and the 
phytohemagglutinin signal peptide (Hunt and Chrispeels 1991). 

Example 1 

5 The bacteriophage XEGF116 (ATCC No. 59956) containing the gene 

encoding the full length polypeptide of human kidney pre-pro-EGF was obtained 
from ATCC. Pro-EGF (FIG, 1) is the 1207 amino acid precursor in which hEGF 
is flanked by polypeptide segments of 907 and 184 residues at its NH 2 - and 
COOH-termini, respectively (Bell et al., 1986). The remainder of the 4.8 kb pre- 

10 pro-EGF gene encodes native signal peptides at both the NH 2 - and COOH- termini 
of pro-EGF. The polypeptide contains a transmembrane (TM) binding region that 
facilitates proper cleavage in the endoplasmic reticulum. 

The full length of cDNA was excised with Sma I, Hind HI, and Eco RI 
restriction enzymes, as shown on FIG. 2, producing two separate fragments. 

15 These were sequentially ligated into compatible Sma I and Eco RI sites in 

pBluescript- creating the 7.5 kb plasmid pZD203. After proper orientation was 
confirmed, pre-pro-EGF cDNA was further excised with Xba I and Cla I 
restriction enzymes and ligated into compatible sites located between the CaMV 
35S promoter and T 7 transcription terminator of binary vector pGA643, forming 

20 the 16 kb plasmid pZD204 (FIG. 3). This plasmid was directly transferred into 
Agrobacterium tumefaciens LBA4404 using the freeze-thaw method (An 1987). 
The transferred plasmid was introduced into tobacco whole plants (by leaf disks) 
and calli (by suspension culture) by co-cultivation with the Agrobacterium thereby 
producing transformants. Over 200 specific samples of transformants were taken 

25 from the co-cultivation and separately placed on kanamycin selective media. The 
co-cultivated transformants that grew were positive transformants. The positive 
transformants were screened under kanamycin selection pressure and preliminary 
ELISA results indicated the presence of hEGF in tobacco calli. Accumulation 
levels of hEGF in select transgenic calli are shown on a ng/g fresh weight basis in 

30 FIG. 4. The bars in FIG. 4 represent a random sample of the specific samples of 



WO 98/21348 PCT/US97/20603 

-13- 

transformants. The highest level of accumulation at approximately 400 ng/(g fresh 
weight cells) (ppb) corresponds to a concentration of 4.1 ng/(mg total soluble 
protein) (ppm) (based on a measured total soluble protein level of approximately 
98 mg/(g fresh weight cells)). The 4.1 ng/(mg total soluble protein) (ppm) 
5 corresponds to 4100 pg/(mg total soluble protein) (ppb) which is almost two 
orders-of-magnitude greater than the result of 60 pg/(mg total soluble protein) 
(ppb) reported by Higo et al. (1993). 

Further ELISA and Northern blot analyses were used to detect high levels of 
foreign protein production and mRNA transcription, respectively. Western blot 
10 analysis, completed to determine protein size, showed that specific EGF bearing 
constructs of 30 KD were produced. This size corresponds to approximately 250 
amino acids. 

Closure 

15 While a preferred embodiment of the present invention has been shown and 

described, it will be apparent to those skilled in the art that many changes and 
modifications may be made without departing from the invention in its broader 
aspects. The appended claims are therefore intended to cover all such changes and 
modifications as fall within the true spirit and scope of the invention. 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: Brian S. Hooker, et al 

(ii) TITLE OF INVENTION: Method of Producing Human Growth 
Factors From Whole Plants or Plant Cell Cultures 

(iii) NUMBER OF SEQUENCES: 8 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Paul W. Zimmerman 

(B) STREET: P.O. Box 999 

(C) CITY: Richland 

(D) STATE: WA 

(E) COUNTRY: USA 

(F) ZIP: 99352 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3 1/2 Magnetic Disk 

(B) COMPUTER: IBM compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: WORD97 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/747,24 6 

(B) FILING DATE: 11-12-96 

(C) CLASSIFICATION: unknown 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: N/A 

(B) FILING DATE: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Paul W. Zimmerman 

(B) REGISTRATION NUMBER: 34 # 761 

(C) REFERENCE /DOCKET NUMBER: E-1519 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 509-375-2981 

(B) TELEFAX: 509-375-2592 

(C) TELEX: 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 81bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : double strands 

( D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: Sense orientation of complementary DNA 
for pro-EGF 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 5' -AGT GAC TCA GTC GAG ... TTC TCA CTC 

GTC-3 end 

(v) FRAGMENT TYPE: 4.5kb Smal/Hindlll double strands DNA 

fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: kidney 

(B) STRAIN: human 

(C) INDIVIDUAL ISOLATE: GI Belle 

(D) DEVELOPMENTAL STAGE: adult 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: fetal human liver library 

(B) CLONE: lambda CH4A; lambda EMBL4 ; lambda GM1416 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: human epithelial growth factor cDNA 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: cross-hybridization 

with mouse cDNA 

(D) OTHER INFORMATION: 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



CCCGGGCCAT GCTCCAGCAA AATCAAGCTG TTTTCTTTTG AAAGTTCAAA CTCATCAAGA TT 62 



ATG 


CTG 


CTC 


ACT 


CTT 


ATC 


ATT 


CTG 


TTG 


CCA 


GTA 


GTT 


TCA 


AAA 


TTT 


AGT 


TTT 


GTT 


116 


AGT 


CTC 


TCA 


GCA 


CCG 


CAG 


CAC 


TGG 


AGC 


TGT 


CCT 


GAA 


GGT 


ACT 


CTC 


GCA 


GGA 


AAT 


170 


GGG 


AAT 


TCT 


ACT 


TGT 


GTG 


GGT 


CCT 


GCA 


CCC 


TTC 


TTA 


ATT 


TTC 


TCC 


CAT 


GGA 


AAT 


224 


AGT 


ATC 


TTT 


AGG 


ATT 


GAC 


ACA 


GAA 


GGA 


ACC 


AAT 


TAT 


GAG 


CAA 


TTG 


GTG 


GTG 


GAT 


278 


GCT 


GGT 


GTC 


TCA 


GTG 


ATC 


ATG 


GAT 


TTT 


CAT 


TAT 


AAT 


GAG 


AAA 


AGA 


ATC 


TAT 


TGG 


332 


GTG 


GAT 


TTA 


GAA 


AGA 


CAA 


CTT 


TTG 


CAA 


AGA 


GTT 


TTT 


CTG 


AAT 


GGG 


TCA 


AGG 


CAA 


386 


GAG 


AGA 


GTA 


TGT 


AAT 


ATA 


GAG 


AAA 


AAT 


GTT 


TCT 


GGA 


ATG 


GCA 


ATA 


AAT 


TGG 


ATA 


440 


AAT 


GAA 


GAA 


GTT 


ATT 


TGG 


TCA 


AAT 


CAA 


CAG 


GAA 


GGA 


ATC 


ATT 


ACA 


GTA 


ACA 


GAT 


494 


ATG 


AAA 


GGA 


AAT 


AAT 


TCC 


CAC 


ATT 


CTT 


TTA 


AGT 


GCT 


TTA 


AAA 


TAT 


CCT 


GCA 


AAT 


548 


GTA 


GCA 


GTT 


GAT 


CCA 


GTA 


GAA 


AGG 


TTT 


ATA 


TTT 


TGG 


TCT 


TCA 


GAG 


GTG 


GCT 


GGA 


602 


AGC 


CTT 


TAT 


AGA 


GCA 


GAT 


CTC 


GAT 


GGT 


GTG 


GGA 


GTG 


AAG 


GCT 


CTG 


TTG 


GAG 


ACA 


656 
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TCA GAG AAA ATA ACA GCT GTG TCA TTG GAT GTG CTT GAT AAG CGG CTG TTT TGG 710 
ATT CAG TAC AAC AGA GAA GGA AGC AAT TCT CTT ATT TGC TCC TGT GAT TAT GAT 7 64 
GGA GGT TCT GTC CAC ATT AGT AAA CAT CCA ACA CAG CAT AAT TTG TTT GCA ATG 818 
TCC CTT TTT GGT GAC CGT ATC TTC TAT TCA ACA TGG AAA ATG AAG ACA ATT TGG 872 
ATA GCC AAC AAA CAC ACT GGA AAG GAC ATG GTT AGA ATT AAC CTC CAT TCA TCA 926 
TTT GTA CCA CTT GGT GAA CTG AAA GTA GTG CAT CCA CTT GCA CAA CCC AAG GCA 980 
GAA GAT GAC ACT TGG GAG CCT GAG CAG AAA CTT TGC AAA TTG AGG AAA GGA AAC 1034 
TGC AGC AGC ACT GTG TGT GGG CAA GAC CTC CAG TCA CAC TTG TGC ATG TGT GCA 1088 
GAG GGA TAC GCC CTA AGT CGA GAC CGG AAG TAC TGT GAA GAT GTT AAT GAA TGT 114 2 
GCT TTT TGG AAT CAT GGC TGT ACT CTT GGG TGT AAA AAC ACC CCT GGA TCC TAT 1196 
TAC TGC ACG TGC CCT GTA GGA TTT GTT CTG CTT CCT GAT GGG AAA CGA TGT CAT 1250 
CAA CTT GTT TCC TGT CCA CGC AAT GTG TCT GAA TGC AGC CAT GAC TGT GTT CTG 1304 
ACA TCA GAA GGT CCC TTA TGT TTC TGT CCT GAA GGC TCA GTG CTT GAG AGA GAT 1358 
GGG AAA ACA TGT AGC GGT TGT TCC TCA CCC GAT AAT GGT GGA TGT AGC CAG CTC 1412 
TGC GTT CCT CTT AGC CCA GTA TCC TGG GAA TGT GAT TGC TTT CCT GGG TAT GAC 14 66 
CTA CAA CTG GAT GAA AAA AGC TGT GCA GCT TCA GGA CCA CAA CCA TTT TTG CTG 1520 
TTT GCC AAT TCT CAA GAT ATT CGA CAC ATG CAT TTT GAT GGA ACA GAC TAT GGA 1574 
ACT CTG CTC AGC CAG CAG ATG GGA ATG GTT TAT GCC CTA GAT CAT GAC CCT GTG 1628 
GAA AAT AAG ATA TAC TTT GCC CAT ACA GCC CTG AAG TGG ATA GAG AGA GCT AAT 1682 
ATG GAT GGT TCC CAG CGA GAA AGG CTT ATT GAG GAA GGA GTA GAT GTG CCA GAA 1736 
GGT CTT GCT GTG GAC TGG ATT GGC CGT AGA TTC TAT TGG ACA GAC AGA GGG AAA 1790 
TCT CTG ATT GGA AGG AGT GAT TTA AAT GGG AAA CGT TCC AAA ATA ATC ACT AAG 184 4 
GAG AAC ATC TCT CAA CCA CGA GGA ATT GCT GTT CAT CCA ATG GCC AAG AGA TTA 1898 
TTC TGG ACT GAT ACA GGG ATT AAT CCA CGA ATT GAA AGT TCT TCC CTC CAA GGC 1952 
CTT GGC CGT CTG GTT ATA GCC AGC TCT GAT CTA ATC TGG CCC AGT GGA ATA ACG 2006 
ATT GAC TTC TTA ACT GAC AAG TTG TAC TGG TGC GAT GCC AAG CAG TCT GTG ATT 2060 
GAA ATG GCC AAT CTG GAT GGT TCA AAA CGC CGA AGA CTT ACC CAG AAT GAT GTA 2114 
GGT CAC CCA TTT GCT GTA GCA GTG TTT GAG GAT TAT GTG TGG TTC TCA GAT TGG 2168 
GCT ATG CCA TCA GTA ATA AGA GTA AAC AAG AGG ACT GGC AAA GAT AGA GTA CGT 2222 
CTC CAA GGC AGC ATG CTG AAG CCC TCA TCA CTG GTT GTG GTT CAT CCA TTG GCA 227 6 
AAA CCA GGA GCA GAT CCC TGC TTA TAT CAA AAC GGA GGC TGT GAA CAT ATT TGC 2330 
AAA AAG AGG CTT GGA ACT GCT TGG TGT TCG TGT CGT GAA GGT TTT ATG AAA GCC 2384 
TCA GAT GGG AAA ACG TGT CTG GCT CTG GAT GGT CAT CAG CTG TTG GCA GGT GGT 24 38 
GAA GTT GAT CTA AAG AAC CAA GTA ACA CCA TTG GAC ATC TTG TCC AAG ACT AGA 24 92 
GTG TCA GAA GAT AAC ATT ACA GAA TCT CAA CAC ATG CTA GTG GCT GAA ATC ATG 254 6 
GTG TCA GAT CAA GAT GAC TGT GCT CCT GTG GGA TGC AGC ATG TAT GCT CGG TGT 2600 
ATT TCA GAG GGA GAG GAT GCC ACA TGT CAG TGT TTG AAA GGA TTT GCT GGG GAT 2654 
GGA AAA CTA TGT TCT GAT ATA GAT GAA TGT GAG ATG GGT GTC CCA GTG TGC CCC 2708 
CCT GCC TCC TCC AAG TGC ATC AAC ACC GAA GGT GGT TAT GTC TGC CGG TGC TCA 2762 
GAA GGC TAC CAA GGA GAT GGG ATT CAC TGT CTT GAT ATT GAT GAG TGC CAA CTG 2816 
GGG GTG CAC AGC TGT GGA GAG AAT GCC AGC TGC ACA AAT ACA GAG GGA GGC TAT 2870 
ACC TGC ATG TGT GCT GGA CGC CTG TCT GAA CCA GGA AAT AGT GAC TCT GAA TGT 2924 
CCC CTG TCC CAC GAT GGG TAC TGC CTC CAT GAT GGT GTG TGC ATG TAT ATT GAA 2978 
GCA TTG GAC AAG TAT GCA TGC AAC TGT GTT GTT GGC TAC ATC GGG GAG CGA TGT 3032 
CAG TAC CGA GAC CTG AAG TGG TGG GAA CTG CGC CTG ATT TGC CCT GAC TCT ACT 308 6 
CCA CCC CCT CAC CTC AGG GAA GAT GAC CAC CAC TAT TCC GTA AGA CAC GCT GGC 3140 
CAC GGG CAG CAG CAG AAG GTC ATC GTG GTG GCT GTC TGC GTG GTG GTG CTT GTC 3194 
ATG CTG CTC CTC CTG AGC CTG TGG GGG GCC CAC TAC TAC AGG ACT CAG AAG CTG 3248 
CTA TCG AAA AAC CCA AAG AAT CCT TAT GAG GAG TCG AGC AGA GAT GTG AGG AGT 3302 
CGC AGG CCT GCT GAC ACT GAG GAT GGG ATG TCC TCT TGC CCT CAA CCT TGG TTT 3356 
GTG GTT ATA AAA GAA CAC CAA GAC CTC AAG AAT GGG GGT CAA CCA GTG GCT GGT 3410 
GAG GAT GGC CAG GCA GCA GAT GGG TCA ATG CAA CCA ACT TCA TGG AGG CAG GAG 34 64 
CCC CAG TTA TGT GGA ATG GGC ACA GAG CAA GGC TGC TGG ATT CCA GTA TCC AGT 3518 
GAT AAG GGC TCC TGT CCC CAG GTA ATG GAG CGA AGC TTT CAT ATG CCC TCC TAT 3572 
GGG ACA CAG ACC CTT GAA GGG GGT GTC GAG AAG CCC CAT TCT CTC CTA TCA GCT 3626 
AAC CCA TTA TGG CAA CAA AGG GCC CTG GAC CCA CCA CAC CAA ATG GAG CTG ACT 3680 
CAG TGA 3686 

AAACTGGAAT TAAAAGGAAA GTCAAGAAGA ATGAACTATG TCGATGCACA GTATCTTTTC 374 6 

TTTCAAAAGT AGAGCAAAAC TATAGGTTTT GGTTCCACAA TC TCT ACG AC TAATCACCTA 3806 

CTCAATGCCT G GAG AC AG AT ACGTAGTTGT GCTTTTGTTT GCTCTTTTAA GCAGTCTCAC 3866 

TGCAGTCTTA TTTCCAAGTA AGAGTACTGG GAGAATCACT AGGTAACTTA TTAGAAACCC 3926 

AAATTGGGAC AACAGTGCTT TGTAAATTGT GTTGTCTTCA GCAGTCAATA CAAATAGATT 3986 

TTTGTTTTTG TTGTTCCTGC AGCCCCAGAA GAAATTAGGG GTTAAAGCAG ACAGTCACAC 404 6 

TGGTTTGGTC AGTTACAAAG TAATTTCTTT GATCTGGACA GAACATTTAT ATCAGTTTCA 4106 

TGAAATGATT GGAATATTAC AATACCGTTA AGATACAGTG TAGGCATTTA ACTCCTCATT 4166 

GGCGTGGTCC ATGCTGATGA TTTTGCCAAA ATGAGTTGTG ATGAATCAAT GAAAAATGTA 4226 
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ATTTAGAAAC TGATTTCTTC AGAATTAGAT GGCCTTATTT TTTAAAATAT TTGAATGAAA 4286 

ACATTTTATT TTTAAAATAT TACACAGGAG GCCTTCGGAG TTTCTTAGTC ATTACTGTCC 4 34 6 

TTTTCCCCTA CAGAATTTTC CCTCTTGGTG TGATTGCACA GAATTTGTAT GTATTTTCAG 4 4 06 

TTACAAGATT GTAAGTAAAT TGCCTGATTT GTTTTCATTA TAGACAACGA TGAATTTCTT 4 4 66 

CTAATTATGA ATTC 4 4 80 
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(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 783bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 
<D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: 

(A) DESCRIPTION: sense orientation of five 

copies of mature EGF concatemers 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 5'-CGC GTC AAG GGT ... TCT CAG TGA TAA-3 

end 

(v) FRAGMENT TYPE: 4.5kb Smal/Hindlll double strands DNA 

fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: kidney 

(B) STRAIN: human 

(C) INDIVIDUAL ISOLATE: Z.Dai, et al. 
<D) DEVELOPMENTAL STAGE: adult 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: fetal human liver library 

(B) CLONE: lambda CH4A; lambda EMBL 4 ; lambda GM1416 

(viii) POSITION IN GENOME: 



(x) 



(A) 


CHROMOSOME/SEGMENT : 


(B) 


MAP POSITION: 


(C) 


UNITS: 


FEATURE: 




(A) 


NAME /KEY : Concatemer of mature EGF fragment 




without linker 


(B) 


LOCATION : 


(C) 


IDENTIFICATION METHOD: PCR cloning 


(D) 


OTHER INFORMATION: 


PUBLICATION INFORMATION: 


(A) 


AUTHORS : 


(B) 


TITLE: 


(C) 


JOURNAL : 


(D) 


VOLUME: 


(E) 


ISSUE: 


(F) 


PAGES : 


(G) 


DATE: 


(H) 


DOCUMENT. NUMBER: 


(I) 


FILING DATE: 


(J) 


PUBLICATION DATE: 


(K) 


RELEVANT RESIDUES IN SEQ ID NO: 




FROM (position) TO (position) 


SEQUENCE 


DESCRIPTION: SEQ ID NO: 2: 



AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


54 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


108 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


AAT 


162 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


216 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


270 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


AAT 


AGT 


324 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


378 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


432 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


AAT 


AGT 


GAC 


486 
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TCT GAA TGT CCC CTG TCC CAC GAT GGG TAC TGC CTC CAT GAT GGT GTG TGC ATG 540 

TAT ATT GAA GCA TTG GAC AAG TAT GCA TGC AAC TGT GTT GTT GGC TAC ATC GGG 594 

GAG CGA TGT CAG TAC CGA GAC CTG AAG TGG TGG GAA CTG CGC AAT AGT GAC TCT 64 8 

GAA TGT CCC CTG TCC CAC GAT GGG TAC TGC CTC CAT GAT GGT GTG TGC ATG TAT 702 

ATT GAA GCA TTG GAC AAG TAT GCA TGC AAC TGT GTT GTT GGC TAC ATC GGG GAG 756 

CGA TGT CAG TAC CGA GAC CTG AAG TGG TGG GAA CTG CGC 795 
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(4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: sense orientation concatemer of 

mature EGF fragments with linkers 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 5 1 -GTC CAG <AGC ... CAG TGA TAA-3 end 

(v) FRAGMENT TYPE: 5-copies of 159bp concatemer mature EGF 

linked with linkers 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: kidney 

(B) STRAIN: human 

(C) INDIVIDUAL ISOLATE: Z. Dai, et al 

(D) DEVELOPMENTAL STAGE: adult 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: concatemer of mature EGF linked with linkers 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: PCR cloning 

(D) OTHER INFORMATION: Cleavage sites at 142-165, 307-331, 

465-489, 631-655. 
(x) 



PUBLICATION INFORMATION: 


(A) 


AUTHORS : 


(B) 


TITLE: 


(C) 


JOURNAL: 


(D) 


VOLUME: 


(E) 


ISSUE: 


(F) 


PAGES : 


(G) 


DATE: 


(H) 


DOCUMENT NUMBER: 


(I) 


FILING DATE: 


(J) 


PUBLICATION DATE: 


(K) 


RELEVANT RESIDUES IN SEQ ID NO: 



FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


54 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


108 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


GGC 


162 


GGA 


AGA 


GTT 


AAC 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


216 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


270 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


324 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


GGC 


GGA 


AGA 


GTT 


AAC 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


378 


GAA 


TGT 


ccc 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


432 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


486 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 


GGC 


GGA 


AGA 


GTT 


AAC 


540 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


594 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


648 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


702 


CTG 


CGC 


GGC 


GGA 


AGA 


GTT 


AAC 


TGC 


ATG 


CAG 


AAT 


AGT 


GAC 


TCT 


GAA 


TGT 


CCC 


CTG 


756 


TCC 


CAC 


GAT 


GGG 


TAC 


TGC 


CTC 


CAT 


GAT 


GGT 


GTG 


TGC 


ATG 


TAT 


ATT 


GAA 


GCA 


TTG 


810 


GAC 


AAG 


TAT 


GCA 


TGC 


AAC 


TGT 


GTT 


GTT 


GGC 


TAC 


ATC 


GGG 


GAG 


CGA 


TGT 


CAG 


TAC 


864 


CGA 


GAC 


CTG 


AAG 


TGG 


TGG 


GAA 


CTG 


CGC 




















891 
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(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330bp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: upstream enhancer (from -343 to 
-90 bp) of 35S promoter 

(iii) HYPOTHETICAL: 



(iv) 


ANTI-SENSE: 


(v) 


FRAGMENT 


TYPE: 253bp upstream of 35S promoter 
enhancer element 


(vi) 


ORIGINAL 


SOURCE: 




(A) 


ORGANISM: cauliflower mosaic virus (CaMV) 




(B) 


STRAIN: Cabb B-D 




(C) 


INDIVIDUAL ISOLATE: Z.Dai, et al 




(D) 


DEVELOPMENTAL STAGE: 




(E) 


HAPLOTYPE: 




(F) 


TISSUE TYPE: 




(G) 


CELL TYPE: 




(H) 


CELL LINE: 




(I) 


ORGANELLE: 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/ SEGMENT : 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME /KEY : 35S promoter B-domain enhancer 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: standard cloning 

(D) OTHER INFORMATION: B-domain of 35S promoter from EcoR V 
site to Hind II site (upstream enhancer region from -343 to -90 bp) 

(x) 



PUBLICATION INFORMATION: 




(A) 


AUTHORS : 




(B) 


TITLE: 




(C) 


JOURNAL : 




(D) 


VOLUME: 




(E) 


ISSUE: 




(F) 


PAGES: 




(G) 


DATE: 




(H) 


DOCUMENT NUMBER: 




(I) 


FILING DATE: 




(J) 


PUBLICATION DATE: 




(K) 


RELEVANT RESIDUES IN SEQ 


ID NO: 




FROM (position) TO 


( 


SEQUENCE 


DESCRIPTION: SEQ ID NO: 


4: 



(xi) 

GTCAACATGG TGGAGCACGA CACACTTGTC TACTCCAAAA ATATCAAAGA TACAGTCTCA 60 
GAAGACCAAA GGGCAATTGA GACTTTTCAA CAAAGGGTAA TATCCGGAAA CCTCCTCGGA 120 
TTCCATTGCC CAGCTATCTG TCACTTTATT GTGAAGATAG TGGAAAAGGA AGGTGGCTCC 180 
TACAAATGCC ATCATTGCGA TAAAGGAAAG GCCATCGTTG AAGATGCCTC TGCCGACAGT 240 
GGTCCCAAAG ATGGACCCCC ACCCACGAGG AGCATCGTGG AAAAAGAAGA CGTTCCAACC 300 
ACGTCTTCAA AGCAAGTGGA TTGATGTGAT 330 
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(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : double strands 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 5 1 -untranscription region of 
chl a/b binding protein 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: llkb EcoR 1 fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: whole plants 

(B) STRAIN: Arabidopsis 

(C) INDIVIDUAL ISOLATE: Ha et al 

30 day old seedlings 



(D) 


DEVELOPMENTAL STAGE: 


(E) 


HAPLOTYPE: 


(F) 


TISSUE TYPE: 


(G) 


CELL TYPE: 


(H) 


CELL LINE: 


(I) 


ORGANELLE : 


IMMEDIATE SOURCE: 


(A) 


LIBRARY: genomic DNA 


(B) 


CLONE: lambda bAT1005 


POSITION IN GENOME: 


(A) 


CHROMOSOME/SEGMENT : 


(B) 


MAP POSITION: 


(C) 


UNITS: 


FEATURE : 




(A) 


NAME/KEY: arabidopsis 


(B) 


LOCATION: 


(C) 


IDENTIFICATION METHOD 


(D) 


OTHER INFORMATION: 


PUBLICATION INFORMATION: 


(A) 


AUTHORS : 


(B) 


TITLE: 


(C) 


JOURNAL: 


(D) 


VOLUME: 


(E) 


ISSUE: 


(F) 


PAGES : 


(G) 


DATE: 


(H) 


DOCUMENT NUMBER: 


(I) 


FILING DATE: 


(J) 


PUBLICATION DATE: 


(K) 


RELEVANT RESIDUES IN i 



(vii) 
(viii) 

(ix) 

(A) NAME/KEY: arabidopsis < 

(B) LOCATION: 

cross -hybridization 

(D) OTHER INFORMATION: 

(x) 



FROM (position) TO (position) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GAATTCATCA ACAAATTACT CCTCAATCAC ACTCCTATAG AAAACGGTTT AAGCTATCAT 60 
TACATGTCTA GTTGGTTTTA CTCAGCCCTA GAAGTGTTGT TTATTGCATC ACTTTCCACG 120 
AAGCACAATT TTTCTTTTTT ACAATCACTA GACCTCACAG GCTCACACAT ATGCTTTAGA 180 
GCACATTCTA AACTTTGAAC TATAAAAGCT GTTAACACTA ATACACTATG CGTTCTTTTT 24 0 
TGCTCCAAAC ACTTTTGATC CATTATTAGG AGACACTCCA CTTAGAAAGA TTTTCTAATC 300 
CTTTGGTCAA CTAGGAAGTT CAAGGTTTTT CTAAACAGAA ATTCATTTCA CAAGTAATTT 360 
AATTTATAAG GAAATGAATA GAGAAATCAA ATCATTGAAG AACTACAAAA TATAGATTCA 420 
AGGTCAGGTC TAAGAAAATA TTCCTGAAGC TCAAAAAAGA GTTTTCCTCT CACATTATAG 480 
AATTGGCCTT TACTTCAACA TTTTCCCACC TATTCCACAT TTGGTCAGAA CATTTTTAAT 540 
TACTTGTGGA TCAATTTCCG GTTGAAATGG GTTTGGTGAA TATCCGGTTC AGTTATATGG 600 
TGGCCGTTGG AATTGGCTTA TTAGTTGTGG CCGTTGTTGA AGCCGTTGGT ATTGGTAAGG 660 
GAGAAGCAGA CTTGTGGCTA TGAGTCTATG ACCATGACTC GTGATTATGG AGCTGTCTTA 720 
TGACCCTGAC CATCACCTTG ATCTGGTGGA TTCCAATGTT TTCTTCTTCT TCTAATAAAA 780 
TATTATGGTC AATACAGGTG CTAATTAAGA TGGTAATAAT TTCTTATGTT TCTGTGGTAA 84 0 
AGTTTGATTC AATTCCGTAG TTTTAGATAA TCTTATTTCC ATACATAAAT TTTATAGTTT 900 
TATCTACTTT GTTCTTATGT TTTATCTCTA GCCAAGAGTT ATTATTATTA TCAGAAGAAG 960 
AAAAAAAAAA GAAGCATATA TACAAAAGGT TTAATAAAAT GT AT TAT AC A AGGCAATTAT 1020 
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CCAAATTTTT TTTGTTTTGG TTTACATTGA TGCTCTCAGG ATTTCATAAG GATAGAGAGA 1080 
TCTATTCGTA TACGTGTCAC GTCATGAGTG GGTGTTTCGC CAATCCATGA AACGCACCTA 1140 
GATATCTAAA ACACATATCA ATTGCGAATC TGCGAAGTGC GAGCCATTAA CCACGTAAGC 1200 
AAACAAACAA TCTAAACCCC AAAAAAAATC TATGACTAGC CAATAGCAAC CTCAGAGATT 1260 
GATATTTCAA GATAAGACAG TATTTAGATT TCTGTATTAT ATATAGCGAA AATCGCATCA 1320 
ATACCAAACC ACCCATTTCT TGGCTTACAA CAACAAATCT TAAACGTTTT ACTTTGTGCT 1380 
GCACTACTCA ACCTTAATGG CCGCCTCAAC AATGGCTCTC TCCTCCCCTG CCTTCGCCGG 1440 
T 1441 
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(7) INFORMATION FOR SEQ ID NO: 6: 



(i) 



(ii) 



(iii) 
(iv) 
(v) 
(vi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 
MOLECULE TYPE: 

(A) DESCRIPTION 



strands 



HYPOTHETICAL: 
ANTI-SENSE: 
FRAGMENT TYPE 
ORIGINAL 

(A) 

(B) 

(C) 

(D) 

(E) 

(F) 

(G) 

(H) 

(I) 



CaMV 35S 5 * -untranscription 
upstream 



Alu 1 (from 7143bp) -EcoRl (to 7517bp) 
SOURCE: 

ORGANISM: cauliflower mosaic virus 
STRAIN: CM4-184 

INDIVIDUAL ISOLATE: RJ Shepherd 
DEVELOPMENTAL STAGE: 
HAPLOTYPE: 
TISSUE TYPE: 
CELL TYPE: 
CELL LINE 
ORGANELLE 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library of CM4-184 

(B) CLONE: pOS-1 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 
UNITS: 



(ix) 



(CJ 
FEATURE: 
(A) 
(B) 
(C) 
(D) 



NAME /KEY: CaMV 35S promoter 
LOCATION: 

IDENTIFICATION METHOD: cross-hybridization 
OTHER INFORMATION: 



(x) PUBLICATION INFORMATION: 

(A) AUTHORS: 

(B) TITLE: 

(C) JOURNAL: 

(D) VOLUME: 

(E) ISSUE: 

(F) PAGES: 

(G) DATE: 

(H) DOCUMENT NUMBER: 

(I) FILING DATE: 

(J) PUBLICATION DATE: 

(K) RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



(xij 



CCCACA GATGGTTAGA GAGGCTTACG CAGCAGGTCT CATCAAGACG ATCTACCCGA 



56 



GCAATAATCT 
TCAGGACTAA 
TTCCAGTATG 
TCTCTAAAAA 
ACCTAACAGA 
ATGACAAGAA 
ATATCAAAGA 
TATCCGGAAA 
TGGAAAAGGA 
AAGATGCCTC 
AAAAAGAAGA 
ACGTAAGGGA 
GTTCATTTCA 



CCAGGAAATC 
CTGCATCAAG 
GACGATTCAA 
GGTAGTTCCC 
ACTCGCCGTA 
GAAAATCTTC 
TACAGTCTCA 
CCTCCTCGGA 
AGGTGGCTCC 
TGCCGACAGT 
CGTTCCAACC 
TGACGCACAA 
TTTGGAGAGA 



AAATACCTTC 
AACACAGAGA 
GGCTTGCTTC 
ACTGAATCAA 
AAGACTGGCG 
GTCAACATGG 
GAAGACCAAA 
TTCCATTGCC 
TACAAATGCC 
GGTCCCAAAG 
ACGTCTTCAA 
TCCCACTATC 
ACACGGGGGA 



CCAAGAAGGT 
AAGATATATT 
ACAAACCAAG 
AGGCCATGGA 
AACAGTTCAT 
TGGAGCACGA 
GGGCAATTGA 
CAGCTATCTG 
ATCATTGCGA 
ATGGACCCCC 
AGCAAGTGGA 
CTTCGCAAGA 
CTCTAGAGGA 



TAAAGATGCA 
TCTCAAGATC 
GCAAGTAATA 
GTCAAAGATT 
ACAGAGTCTC 
CACACTTGTC 
GACTTTTCAA 
TCACTTTATT 
TAAAGGAAAG 
ACCCACGAGG 
TTGATGTGAT 
CCCTTCCTCT 
TCCCCGGGTG 



GTCAAAAGAT 116 
AGAAGTACTA 176 
GAGATTGGAG 236 
CAAATAGAGG 296 
TTACGACTCA 356 
TACTCCAAAA 416 
CAAAGGGTAA 47 6 
GTGAAGATAG 536 
GCCATCGTTG 596 
AGCATCGTGG 656 
ATCTCCACTG 716 
ATATAAGGAA 77 6 
GTCAGT 832 
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(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 473bp 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: double strands 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: 

(A) DESCRIPTION: 5 1 -untranscription upstream of 
ribosomal protein L34 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 1500bp BamH-Hind 111 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: tobacco NT1 cells 

(B) STRAIN: 

(C) INDIVIDUAL ISOLATE: Z.Dai, et al 

(D) DEVELOPMENTAL STAGE: 3 days old 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: NT1 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library 

(B) CLONE : TSC 40 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 

(B) MAP POSITION: 

(C) UNITS: 

(ix) FEATURE: 

(A) NAME/KEY: RPL-34 promoter 

(B) LOCATION: 

(C) IDENTIFICATION METHOD: plaque hybridization 

(D) OTHER INFORMATION: 

(x) 



PUBLICATION INFORMATION: 




(A) 


AUTHORS: 




(B) 


TITLE: 




(C) 


JOURNAL : 




(D) 


VOLUME: 




(E) 


ISSUE: 




(F) 


PAGES : 




(G) 


DATE: 




(H) 


DOCUMENT NUMBER: 




(D- 


FILING DATE: 




(J) 


PUBLICATION DATE: 




(K) 


RELEVANT RESIDUES IN SEQ 


ID NO: 




FROM (position) TO 


( 

7: 


SEQUENCE 


DESCRIPTION: SEQ ID NO: 



(xi) 

AGATCTCT CTTTGTATTC TTATTGATGT ACTGGTTTGA AGATGAATAA AATCTTTCAT 58 
TCCACCAAAA AAAGAATGAA AATAAAATTT TAATATACAT GTTGATATAG ACAAAGAAGA 118 
AAAAAAAAGT TGTGATTACA TTTATTGACT ATTTGATGCC AATATCTATA ACTAGAGCTA 178 
TTTTCTATCA ATTATATGGG TATGTTGTTA TACCATGCCA AAACCTCAAT TCATAATGTG 238 
CTTGTTTAAA CCCAGTTTAA TGGGCTAACA TGTTGATGGG CTTATAGGCC CGTCTGATTT 298 
CCTTGCCAGA CACTAGTAAG TAAATGATTC TATCATCCAA TATCAACCGT GGGATCTAGG 358 
GCTTGTCCCA CTTATATACA CTACATATAT TTAACTTTCC TTTAGCCCTT CTGCTTCAGC 418 
CCCCAAAACA AAGAAAGAAG CTACAGAGAG AATAGCAGCG CCGCCGTGAA AAATG 473 
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(9) INFORMATION FOR SEQ ID NO: 8: 



(i) 



(ii) 



SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1162bp 

TYPE: Nucleic acid 
STRANDEDNESS: double strands 
TOPOLOGY: unknown 



(B) 
(C) 
(D) 

MOLECULE TYPE: 
(A) DESCRIPTION: 



5* -untranscription region of 35S gene 
from CaMV with 2 copies of B domains 

(iii) HYPOTHETICAL: 

(iv) ANTI-SENSE: 

(v) FRAGMENT TYPE: 253bp Hindlll/EcoRV fragment + 343bp Hind 

11/EcoRl fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: whole cell 

(B) STRAIN: CM4-184 

(C) INDIVIDUAL ISOLATE: Z.Dai, et al 

(D) DEVELOPMENTAL STAGE: 

(E) HAPLOTYPE: 

(F) TISSUE TYPE: 

(G) CELL TYPE: 

(H) CELL LINE: 

(I) ORGANELLE: 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library of CM4-184 

(B) CLONE: POS-1 

(viii) POSITION IN GENOME: 

(A) CHROMOSOME/SEGMENT: 
MAP POSITION: 
UNITS: 



tlx) 



(x) 



(B) 
(C) 
FEATURE: 
(A) 



NAME /KEY: CaMV 35S promoter with duplication 
of upstream B domain 

LOCATION: 

IDENTIFICATION METHOD: 
OTHER INFORMATION: 
PUBLICATION INFORMATION: 
(A) AUTHORS: 
TITLE: 
JOURNAL : 
VOLUME : 
ISSUE: 
PAGES : 
DATE: 

DOCUMENT NUMBER: 
FILING DATE: 
PUBLICATION DATE: 
RELEVANT RESIDUES IN SEQ ID NO: 

FROM (position) TO (position) 

SEQ ID NO: T: 



(B) 
(C) 
(D) 



(B) 
(C) 
(D) 
(E) 
(F) 
(G) 
(H) 
(I) 
(J) 
(K) 



(xi) SEQUENCE DESCRIPTION: 

CCC ACAGATGGTT AGAGAGGCTT ACGCAGCAGG TCTCATCAAG ACGATCTACC 
CGAGCAATAA TCTCCAGGAA ATCAAATACC TTCCCAAGAA GGTTAAAGAT GCAGTCAAAA 
GATTCAGGAC TAACTGCATC AAGAACACAG AGAAAGATAT ATTTCTCAAG ATCAGAAGTA 
CTATTCCAGT ATGGACGATT CAAGGCTTGC TTCACAAACC AAGGCAAGTA ATAGAGATTG 
GAGTCTCTAA AAAGGTAGTT CCCACTGAAT CAAAGGCCAT GGAGTCAAAG ATTCAAATAG 
AGGACCTAAC AGAACTCGCC GTAAAGACTG GCGAACAGTT CATACAGAGT CTCTTACGAC 
TCAATGACAA GAAGAAAATC TTCGTCAACA TGGTGGAGCA CGACACACTT GTCTACTCCA 
AAAATATCAA AGATACAGTC TCAGAAGACC AAAGGGCAAT TGAGACTTTT CAACAAAGGG 
TAATATCCGG AAACCTCCTC GGATTCCATT GCCCAGCTAT CTGTCACTTT ATTGTGAAGA 
TAGTGGAAAA GGAAGGTGGC TCCTACAAAT GCCATCATTG CGATAAAGGA AAGGCCATCG 
TTGAAGATGC CTCTGCCGAC AGTGGTCCCA AAGATGGACC CCCACCCACG AGGAGCATCG 
TGGAAAAAGA AGACGTTCCA ACCACGTCTT CAAAGCAAGT GGATTGATGT GATAACATGG 
TGGAGCACGA CACACTTGTC TACTCCAAAA ATATCAAAGA TACAGTCTCA GAAGACCAAA 
GGGCAATTGA GACTTTTCAA CAAAGGGTAA TATCCGGAAA CCTCCTCGGA TTCCATTGCC 
CAGCTATCTG TCACTTTATT GTGAAGATAG TGGAAAAGGA AGGTGGCTCC TACAAATGCC 
ATCATTGCGA TAAAGGAAAG GCCATCGTTG AAGATGCCTC TGCCGACAGT GGTCCCAAAG 



53 

113 

173 

233 

293 

353 

413 

473 

533 

593 

653 

713 

773 

833 

893 

953 
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ATGGACCCCC ACCCACGAGG AGCATCGTGG 
AGCAAGTGGA TTGATGTGAT ATCTCCACTG 
CTTCGCAAGA CCCTTCCTCT ATATAAGGAA 
CTCTAGAGGA TCCCCGGGTG GTCAGT 



-29- 

AAAAAGAAGA CGTTCCAACC ACGTCTTCAA 1013 
ACGTAAGGGA TGACGCACAA TCCCACTATC 1073 
GTTCATTTCA TTTGGAGAGA ACACGGGGGA 1133 

1159 
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CLAIMS 

We claim: 

1. A method of producing human growth factors from plant cells, 
5 comprising the steps of: 

(a) obtaining a positive transformant of the plant cells, the positive 
transformant carrying genetic material encoding the production of a human growth 
factor with a length of at least 200 amino acids; 

(b) cultivating the positive transformant; and 
10 (c) obtaining the human growth factors. 

2. The method as recited in claim 1, wherein obtaining the positive 
transformant has the step of: 

modifying a chimeric cDNA encoding the human growth factor with a 
15 length of at least 200 amino acids, for subcloning into a plant expression vector. 

3. The method as recited in claim 2, further comprising the steps of: 

(a) subcloning the chimeric cDNA into the plant expression vector 
and obtaining a subcloned plant expression vector; 
20 (b) transferring the subcloned plant expression vector into a 

plurality of plant cells; 

(c) selecting a plurality of positive transformants from the plurality of 
plant cells on an antibiotic selective media; 

(d) permitting growth of the portion of the plurality of plant cells in 
25 whole plants or suspensions; and 

(e) extracting a liquid containing the human growth factor from the 
plurality of transgenic plant cells. 



4. The method as recited in claim 3, wherein transferring is by direct 
30 particle bombardment. 



WO 98/21348 




PCT/US97/20d03 



5. The method as recited in claim 3, wherein transferring is by 
Agrobacterium mediated transformation. 

6. The method as recited in claim 5, wherein Agrobacterium mediated 
5 transformation comprises the steps of: 

(a) placing the subcloned plant expression vector to an 
agrobacterium; 

(b) co-cultivating the Agrobacterium containing the subcloned 
plant expression vector with the plurality of plant cells. 

10 

7. The method as recited in claim 1, wherein the step of cultivating is 
with a whole plant. 

8. The method as recited in claim 1, wherein the step of cultivating is 
15 with a plant tissue culture. 

9. The method as recited in claim 1, wherein the step of obtaining is 
selected from the group consisting of ultrafiltration, affinity chromatography, and 
electrophoresis. 

20 

10. The method as recited in claim 1, wherein the length of at least 200 
amino acids is obtained by cloning a cDNA. 

11. The method as recited in claim 10, wherein said cDNA is a pre-pro- 
25 EGF cDNA. 

12. The method as recited in claim 1, wherein the length of at least 200 
amino acids is obtained by synthesizing a cDNA. 
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13. The method as recited in claim 12, wherein said synthesizing is 
concatomerizing multiple gene copies to obtain the length of at least 200 amino 
acids. 

5 14. The method as recited in claim 1, further comprising increasing an 

overall size of a gene to be expressed with a fusion construct encoding an hEGF 
linked to a protein that is efficiently produced in plant systems. 

15. The method as recited in claim 1, wherein said human growth factor 
10 is selected from the group consisting of epidermal growth factor (EGF), 

transforming growth factor (TGF), vascular endothelial growth factor (VEGF), 
platelet-derived growth factor (PDGF), fibroblast growth factor (FGF), tumor 
necrosis factor (TNF), heparin-binding epidermal growth factor (HBEGF), insulin- 
like growth factor (ILGF), platelet-derived endothelial cell growth factor 
15 (PDECGF), platelet-derived angiogenesis factor (PDAF), and bone-and-cartilage 
inducing growth factor (BCIF) and combinations thereof. 

16. The method as recited in claim 2, wherein modifying is by adding a 
regulatory element selected from the group consisting of leader sequences, signal 

20 peptides, transcription promoters or enhancers, and transcription terminators. 

17. The method as recited in claim 2, wherein modifying a chimeric 
cDNA, comprises the steps of: 

(a) adding said transcription promoter to the upstream or 5' end of 
25 the chimeric cDNA; and 

(b) adding said transcription terminator to the downstream or 3' 
end of the chimeric cDNA. 
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18. The method as recited in claim 17, further comprising adding an 
additional regulatory element encoding a signal peptide, said additional regulatory 
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element added between the transcription promoter and the upstream 5' end of the 
chimeric cDNA. 

19. The method as recited in claim 18, further comprising adding a 
regulatory element between the transcription promoter and the additional 
regulatory element encoding the signal peptide to enhance mRNA stability. 

20. The method as recited in claim 18, further comprising adding a 
regulatory element at the downstream or 3' end of the chimeric cDNA to enhance 
mRNA stability. 

21. The method as recited in claim 17, wherein transcription promoters 
limit growth factors production to a non-crop portion of a transgenic whole plant. 

22. The method as recited in claim 21, wherein the transcription promoters 
are selected from the group consisting of an upstream enhancer region (-343 to -90 
bp) of a CaMV 35S promoter, a chlorophyll a/b binding promoter (cabl) and 
combinations thereof. 

23. The method as recited in claim 17, wherein the transcription promoters 
are selected from the group consisting of a modified 35S promoter, TSC29 
promoter, TSC40 promoter and combinations thereof. 

24. The method as recited in claim 23, wherein the modified 35S 
promoter is a 35 S promoter modified by duplicating an upstream enhancer region 
(-343 to -90 bp) of the 35S promoter to increase transcription activity. 

25. The method as recited in claim 2, wherein said cDNA is a pre-pro- 
EGF cDNA. 
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26. The method as recited in claim 25, wherein said pre-pro-EGF cDNA 
has approximately 4.5 kb, whereby overall titers of active hEGF in both whole 
plants and cell culture are increased. 

27. The method as recited in claim 2, wherein the length of at least 200 
amino acids is obtained by synthesizing the cDNA. 

28. The method as recited in claim 27, wherein said synthesizing is 
concatomerizing multiple gene copies to obtain the length of at least 200 amino 
acids. 

29. The method as recited in claim 28, wherein said multiple gene copies 
are an oligomeric polypeptide having of repeated hEGF domains. 
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30. The method as recited in claim 2, further comprising increasing an 
overall size of a gene to be expressed with a fusion construct encoding an hEGF 
linked to a protein that is efficiently produced in plant systems. 
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